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1.  INTRODUCTION 


In  this  article  we  present  a  large  number  of  probability  density 
functions  from  20  different  families.  They  were  found  through  an 
extensive  search  of  the  statistical  literature,  with  the  intention  of 
listing  distributions  that  arise  in  practical  work.  The  purpose  of 
the  collection  was  to  use  the  exact  percentage  points  of  a  typical 
distribution  to  assess  the  accuracy  of  various  methods  of  approximating 
the  density  using  four  or  more  moments.  The  methods  of  approximation 
were  to  fit  Pearson  curves  or  Johnson  curves,  using  four  moments  in 
each  case,  or  to  fit  Cornish-Flsher  expansions,  using  four  or  more 
moments.  Thus ,  we  did  not  wish  to  use  members  of  the  Pearson  system 
themselves,  or  members  of  the  Johnson  system;  this  excludes  the  normal, 
chi-squared,  t,  F,  gamma  and  beta  distributions. 

For  the  purposes  of  the  study,  calculation  of  the  exact  significance 
points,  along  with  at  least  the  first  four  moments,  was  necessary. 

As  many  as  three  parameters  could  be  varied  for  several  of  the  families 
of  densities,  giving  rise  to  a  large  number  of  possible  individual 
distributions.  The  functions  of  the  moments  that  we  used  to  index  the 
distributions  are: 

2 

S1  m~  and  g2  U~1 

y2  y2 

where  are  the  central  moments  of  the  distribution.  We  chose  the  para¬ 
meters  to  cover  a  fairly  broad  area  of  the  82>  plane,  subject  to  the 

constraints  /ST  £  2  and  B2  £  14.  The  395  distributions,  from  20  different 
families  of  distributions,  are  displayed  in  Figure  1,  each  family  denoted  by 
a  different  letter  of  the  alphabet.  A  remarkable  feature  of  the  large  number 
of  distributions,  which  are  naturally  occurring  densities,  but  a  little  off 


Che  mainstream  of  statistical  work,  is  how  many  of  them  lie  either  on  the 
line  of  symmetry  ■  0,  or  are  close  to  the  chi-squared  line  (see  Figure  1) . 

The  results  of  the  comparison  of  approximations  will  be  presented 
in  a  later  report.  In  the  meantime,  this  collection  is  documented  as 
it  will  be  useful  to  other  workers  who  wish  to  find  representative 
distributions  with  given  values.  In  section  2  the  various 

families  and  types  of  distributions,  along  with  parameter  values  and 
(^6^,  Bj)  values,  are  described.  Numerical  methods  for  computation  of 
moments  and  cumulative  distribution  functions  are  discussed  in  section  3. 

2.  DESCRIPTION  OF  THE  DISTRIBUTIONS 

2.1  Noncentral  Chi- squared  Distributions  (denoted  by  'A'  on  Figure  1) 
v 

Let  X  *  E  (2,+d. )  2  where  the  Z.  are  i.i.d.  (independent  and 

i-1  1  1  1 

identically  distributed)  N(0,1)  and  the  d^  are  constants.  The 
distribution  of  X  depends  on  d^,  d2,  .  .  • »  dv  only  through 
v 

X  -  Ed.2  and  is  called  the  noncentral  chi-squared  distribution  with 
i-1 

v  degrees  of  freedom  and  non-centrality  parameter  X,  denoted  here 
by  x2(v,X).  Cumulants  of  all  order  exist,  with  k ■  -  2r_^(v+rX) (r-1) ! 
(Johnson  and  Kotz  (1970b,  p.  134)).  The  cdf  F(x)  was  evaluated  using 
the  algorithm  of  Shell  and  O'Muircheartaigh  (1977).  Parameters  of 
the  fifteen  x  ^v.X)  distributions  used  are  shown  in  Table  1. 


Table  1.  y*(v.X)  Distributions 


V 

X 

62 

V 

X 

62 

1.0 

1.44 

1.97 

8.39 

4.0 

1.96 

1.25 

5.27 

1.0 

1.96 

1.78 

7.38 

6.0 

6.76 

0.86 

4.04 

1.0 

7.84 

1.02 

4*40 

6.0 

19.36 

0.61 

3.50 

2.0 

5.76 

1.10 

4.64 

6.0 

31.36 

0.50 

3.33 

3.0 

0.16 

1.63 

6.96 

8.0 

6.76 

0.80 

3.91 

3.0 

3.24 

1.23 

5.13 

12.0 

6.76 

0.71 

3.72 

3.0 

7.84 

0.93 

4.18 

20.0 

36.00 

0.41 

3.23 

4.0 

0.04 

1.41 

6.00 

2. 2_ Noncentral  F  distributions  (denoted  by  'B'  on  Figure  1) 

Let  Tx  be  distributed  as  x2(vx,X),  be  distributed  as 
X  2(v2,0.0)  and  suppose  and  are  Independent.  Then 
X  “  (Y1/v1)/(Y2/v2)  is  said  to  have  a  noncentral  F-distribution  with 
V  v2  ^e8reea  °f  freedom  and  noncentrality  parameter  X,  denoted 
here  by  FCv^v^X).  The  first  four  central  moments,  given  by  Pearson 
and  Tiku  (1970),  who  set  l  -  X/v ,  are: 


v2(l+i) 

v2~2 


(v2  >  2) 


2v2  jVy2"2) 


2  v1(v2-2)2(v2-4) 


(v2  >  4); 


_  8v23(v14-v2-2)(2v14v2-2)  ^  ^  6vxi2  2vf*3  -| 

3  v.  2(v,-2)3(v„-4)(v„-6)  f+3it2v1+y2-2  +  (v  +V.-2) (2v.+v.-2)  f  (v2  >  6) 


12v2^(v14v2-2) 

4  Vi3(v2-2> 4 (v2-4) (v2~6) (v2«8) 


|{2(3v1+v2-2) (2v1+v2-2)+(v1+v2-2) (v2»2) (Vj+2) } (1+44)  + 

•  9  li  . 


2v1(3v]+2v2-4)  (v2+10)4  2+4vx  2(v2+10)4 


3  v13(v2+10)£4 
(Vj+Vj-*) 


(v2  >  8) 


Th«  exact  pdf  f(x)  (0  <  x  <  •)  and  cdf  F(x)  are  very  complicated 

and  are  not  given  here.  Lachenbruch  (1967)  gives  values  of  x  for 

a 

a  -  .01,  .025,  .05,  .10,  .5,  .90,  .95,  .975,  .99,  v±  -  1(1)  10,  15, 
20(10)60,  120,  v2  -  2(2)10(10)40,  60  and  X  -  1(1)10.  The  parameters 
and  significance  points  of  the  twenty-eight  F^.v^X)  distributions 
used  are  given  in  Table  2. 


2.3  Noncentral  t  distributions  (denoted  by  'C'  on  Figure  1 


Let  X  ■  (Z+6)//U/v,  where  Z  is  N(0,1),  U  is  central  chi-squared 
with  v  degrees  of  freedom,  6  is  a  constant  and  Z  and  U  are  independent. 
Then  X  has  the  noncentral  t  distribution  with  v  degrees  of  freedom  and 
noncentrality  parameter  6,  denoted  here  by  t(v,6).  Hogben,  Plnkham 
and  Wilk  (1961)  give  coefficients  to  find  the  first  four  central 
moments  of  t(v,6)  for  v  -  2(1)  25(5)50,  60,  70,  80,  90,  100,  150, 
200(100)1000.  These  appear  as  Table  28  in  Pearson  and  Hartley  (1972). 
Merrington  and  Pearson  (1958)  give  for  the  central  moments  of  t(v,6): 

u  •  S/v/2  r(>s(v-l))/r(v/2),  where  T(x)  ■  /*tx  ^e  Cdt; 

o 


'(l«V(v2)  -  u2  (v  >  2) ; 


w  Tv-TKv-3)  "  2v2  (v  >  3); 


2  2  4 

_  v  (3+66  +6  ) 

J4  (v-2) (v-4) 


» 

v(v+i)(5 
“  W  (v-2) 


<5  +3(3v-5) 
) (v-3) 


(v  >  4), 


Ic  should  also  be  noted  that  t(v,6)  has  range  C-0*,00). 


The  cdf  for  Integral  values  of  v  can  be  expressed  (Owen  (1962, 
p.  108))  as  a  finite  sum  involving  the  N(0,1)  pdf  and  cdf  and 
Owen’s  T-function  T(h,a)  (see  Owen  (1962,  p.  184)).  This  function  is 


defined  by: 


6 

‘•a) '  i 


dx  (-®<h,  a<®). 


The  algorithm  of  Cooper  (1968)  for  computing  Pr(t(v,6)  <  t),  was  used 
with  two  changes.  For  computing  the  N(0,1)  cdf  the  algorithm  of 
Hill  (1973)  was  used.  Also,  the  more  accurate  algorithm  of  Young  and 
Minder  (1974)  for  computing  T(h,a)  was  utilized  (see  also  Hill  (1978) 
and  Thomas  (1979)).  The  parameters  of  the  136  t(v,S)  distributions  used 
are  given  in  Table  3. 


dratic  Forms  (denoted  by  'O'  on  Figure  1 


Let  Zy  Z 2 ,  .  .  . ,  be  independent  N(0,1)  random  variables  and 
k  2 

let  Q(A,k)  «  Z  Z.  ,  where  X'  -  (X. ,X«, . . . ,X. )  is  a  vector  whose 

-  i«l  11  ~  1  *  K 

components  are  constants.  The  central  quadratic  form  Q(X,k) 

("central"  is  hereafter  omitted)  is  a  simple  weighted  sum  of  independent 
chi-squared  variables,  each  with  one  degree  of  freedom.  Cumulants  of 
all  orders  are  easily  computed  from  (Johnson  and  Kotz  (1970b,  p.  153)): 


k  -  Z1’1  (r-1) !  Z  X  r. 
r  i-1  1 


X)  Distributions 


Owen  (1962,  pp.  182-183,  205-206)  gives  cumulative  probabilities  and 

selected  significance  points  for  the  cases  k  *  2,3.  These  are  abridged 

versions  of  the  reuslts  in  Grad  and  Solomon  (1955)  and  Solomon  (1960) . 

Johnson  and  Kotz  (1968)  give  significance  points  xa  for  a-  .01,  .025,  .05 

.10,  .25,  .50,  .75,  .90,  .95,  .975,  .99,  .995,  k*4,5.  Owen  choses  X  so 

k  2  k  2 

that  £  X  «  1,  while  Johnson  and  Kotz  have  £  X,  *  k.  Several 
i-1  1  i-1  1 

additional  Q(X,k)  distributions  were  also  considered,  with  k  *  5,6,7, 

8,9,10  and  12.  Cumulative  probabilities  were  computed  using  the 

algorithm  of  Shell  and  O'Muircheartaigh  (1977).  Twenty  distributions 

from  the  Q(X,k)  family  were  used,  with  parameters  as  given  in  Table  5. 


2.5  z  Distributions  (denoted  by  'E'  on  Figure  1 


Let  z  -  j  in  F,  where  F  has  the  (central)  F  distribution  with 
v^^  degrees  of  freedom.  Although  z,  used  by  Fisher  (1924)  in  place 
of  F  for  purposes  of  approximation  and  tabulation,  is  a  random 
variable,  we  shall  adhere  to  the  convention  of  denoting  it  by  a  small 
letter.  This  distribution  will  be  noted  here  by  z(v^,v2).  Cumulants 
of  all  orders  are  finite  and  are  given  by  Johnson  and  Kotz  (1970b,  p.78) 
as  follows: 


*1*2  [in (v^Vj^  +  iKvj/2)  -  tp(v2/2)  ] ; 

<r  -  2"r  [^(r-1)(v1/2)  +  (-l)r  ^(r"1)(v2/2)],  (r  >  2). 

(S) 

Computation  of  ij)(x)  and  1 1>  (x) ,  the  digamma  function  and  its  deriv¬ 

atives,  is  discussed  in  section  3.1. 


5 

V 

6  S2 

V 

9.79 

10.22 

10.72 
11.29 
11.93 
12.60 
13.32 

6.02 

6.07 

6.16 

6.28 

6.43 

6.61 

6.81 

7.04 

7.29 

7.55 

7.83 

8.12 

8.42 

8.72 

9.02 

9.33 

9.63 

9.94 

10.23 

10.52 

10.81 

11.09 

11.36 

11.62 

11.87 

12.11 

12.34 

12.57 

12.78 

12.99 

13.19 

13.38 

13.56 

13.73 
5.04 
5.08 


5.14 

5.31 

5.42 

5.54 
5.67 

5.81 

5.96 
6.12 
6.62 
6.79 

6.96 

7.14 

7.31 
7.48 

7.81 

7.97 
8.13 

8.43 
8.58 
8.72 
8.99 
9.11 
9.35 
9.57 
9.77 

9.96 
10.22 
10.37 
10.51 
11.66 
12.05 
12.35 

4.52 

4.55 

4.63 
4.76 
4.91 
5.10 
5.30 

5.63 
5.75 

5.97 
6.20 

6.32 

6.43 


Percentage  points  and  cumulative  probabilities  of  the  F  distri- 


nl 


bution  can  be  used,  since  Pr(z(v^,v2)  <  z)  ■  Pr(F  <  exp(2z)),  where 
F  has  the  F  distribution  with  v1>v2  degrees  of  freedom.  Thirty-one 
z(vi,v2)  distributions  were  used,  with  parameters  as  given  in  Table  4 

Table  4 :  z(v,  .v„)  distributions 


1 

2 

0.85 

5.40 

3 

4 

0.21 

3.76 

5 

10 

0.31 

3.48 

1 

3 

1.13 

5.87 

3 

5 

0.35 

3.80 

5 

20 

0.49 

3.64 

1 

4 

1.25 

6.14 

3 

10 

0.63 

4.07 

6 

6 

0.00 

3.38 

1 

5 

1.31 

6.32 

3 

30 

0.82 

4.40 

10 

10 

0.00 

3.22 

1 

10 

1.43 

6.66 

3 

100 

0.89 

4.55 

10 

20 

0.20 

3.22 

1 

30 

1.50 

6.89 

4 

4 

0.00 

3.59 

10 

40 

0.32 

3.29 

2 

3 

0.38 

4.19 

K9 

10 

0.44 

3.69 

10 

100 

0.44 

3.37 

2 

5 

0.69 

4.47 

MM 

20 

0.61 

3.88 

20 

20 

0.00 

3.10 

2 

10 

0.92 

4.87 

Bfl 

40 

0.69 

4.02 

20 

40 

0.14 

3.11 

2 

20 

1.03 

5.12 

u 

100 

0.74 

4.12 

30 

40 

0.05 

3.06 

3 

3 

0.00 

3.81 

II 

o 


.■nr*K  ’r-' 


Table  5:  Q(X»k)  Distributions 


k 

\ 

*2 

3 

0.4 

0.3 

0.3 

1.68 

7.34 

4 

1.8 

1.5 

0.4 

0.3 

1.91 

8.68 

4 

1.5 

1.5 

0.8 

0.2 

1.74 

7.71 

4 

1.5 

1.0 

0.8 

0.7 

1.61 

7.20 

4 

1.2 

1.2 

1.2 

0.4 

1.57 

6.73 

4 

1.2 

1.2 

0.8 

0.8 

1.49 

6.44 

4 

1.2 

1.0 

0.9 

0.9 

1.45 

6.19 

5 

1.8 

1.8 

0.6 

0.5 

0.3 

1.77 

7.93 

5 

1.2 

1.2 

0.9 

0.9 

0.8 

1.32 

5.67 

5 

1.2 

1.0 

1.0 

0.9 

0.9 

1.29 

5.52 

5 

0.4 

0.2 

0.2 

0.1 

0.1 

1.75 

8.15 

6 

0.3 

0.3 

0.2 

0.1 

0.05 

0.05 

1.57 

6.89 

6 

0.4 

0.2 

0.1 

0.1 

0.1 

0.1 

1.83 

8.75 

7 

0.3 

0.2 

0.1 

0.1 

0.1 

0.1 

0.1 

1.48 

6.78 

8 

0.2 

0.2 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

1.19 

5.33 

8 

0.3 

0.3 

0.1 

0.1 

0.05 

0.05 

0.05 

0.05 

1.66 

7.47 

8 

0.4 

0.2 

0.1 

0.1 

0.05 

0.05 

0.05 

0.05 

1.91 

9.22 

9 

0.3 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.05 

0.05 

1.54 

7.35 

10 

0.2 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.05 

0.05 

1.11 

5.10 

12 

0.2 

0.1 

0.1 

0.1 

0.1 

0.1 

0.05 

0.05 

0.05 

0.05  0.05 

0.05 

1.14 

5.33 

2.6  Weibull  Distributions  (denoted  by  on  Figure  1) 

If  the  random  variable  X  has  pdf: 

f(x)  -  ca_1[ (x-0)/a]c  1  exp[-((x-0)/a)c  ]  0<x<«,  c,a>0, 

X  is  said  to  have  the  Weibull  distribution  with  parameters  c,  a  and  0 
(see  e.g.  Johnson  and  Kotz  (1970a,  p.  250)).  We  here  consider  the 
standard  form,  denoted  by  Welb  (c),  of  the  Weibull  distribution, 
obtained  by  setting  a-1,  0-0,  having  pdf: 

f (x)  -  cxc_1  exp(-xc)  0<x<-, 

and  cdf: 

F(x)  •  1  -  exp[-xC]  0<x<*. 

The  Welb  (c)  distribution  has  moments  about  the  origin  given  by: 

•  r(f  +  l). 

where  F (x)  -  [  t*”^  e-t  dt.  Computer  evaluation  of  the  gamma  function 

'o 

is  discussed  in  Section  3.1.  The  parameter  values  for  the  30  Welb  (c) 
distributions  used  are  displayed  in  Table  6. 


Table  6:  Welb  (c)  Distributions 


1.04 

1.89 

8.26 

1.60 

0.96 

4.04 

5.50 

0.32 

2.96 

1.08 

1.78 

7.64 

1.68 

0.88 

3.82 

0.37 

3.04 

1.10 

1.73 

7.36 

1.76 

0.81 

3.64 

7.00 

0.46 

3.19 

1.12 

1.69 

7.10 

1.90 

0.70 

3.38 

8.50 

0.56 

3.39 

1.16 

1.60 

6.64 

2.10 

0.57 

3.13 

10.00 

0.64 

3.57 

1.20 

1.52 

6.24 

2.20 

0.51 

3.04 

15.00 

0.79 

4.00 

1.24 

1.45 

5.88 

2.50 

0.36 

2.86 

20.00 

0.87 

4.27 

1.30 

1.35 

5.43 

3.00 

0.17 

2.73 

30.00 

0.95 

4.58 

1.40 

1.20 

4.84 

3.50 

0.03 

2.71 

50.00 

1.02 

4.88 

1.50 

1.07 

4.39 

4.00 

0.09 

2.75 

75.00 

1.06 

5.04 

2.7  Generalized  Logistic  Distributions  (denoted  by  'G1  on  FI: 


gure  1) 

If  the  random  variable  X  has  the  pdf: 

f(x)  -  p  exp(x)  [l+exp(x)]~^p+1^  p>l, 

X  has  the  generalized  logistic  distribution  with  parameter  p,  denoted 
by  GL(p).  This  distribution  was  studied  by  Dubey  (1969)  and  has  cdf: 

F(x)  -  1  -  (1+exp (x) )~P  -®<x<». 

Note  chat  the  logistic  distribution  is  obtained  by  setting  p  equal  to 
one.  The  distribution  has  finite  cumulants  of  all  orders  (see  Johnson 
and  Kotz  (1970b,  p.  18))  given  by: 

<1  -  <j/(l)  -  ^ Cp) ; 

•cr  -  *(r"1)  (1)  +  (-l)r  i|»(r"1)  (P),  (r>2) . 

Six  GL(p)  distributions  were  used,  with  parameters  as  given  in  Table  7. 

Table  7:  GL(p)  Distributions 


p 

*2 

1 

0.00 

4.20 

2 

0.58 

4.33 

3 

0.77 

4.59 

4 

0.87 

4.76 

7 

0.99 

5.01 

15 

1.07 

5.21 

2.8  Chi  Distributions  (denoted  by  *H*  on  Figure  1) 

The  random  variable  X  is  said  to  have  the  chi  distribution  with 
v  degrees  of  freedom,  denoted  by  x(v),  if  X2  is  (central)  chi-squared 
with  v  degrees  of  freedom.  Johnson  and  Welch  (1939)  give  formulae 
for  the  first  six  cumulants  of  x(v)»  as  follow: 


^  r((v+l)/2)/r(v/2); 


•c4  -  \  -  <2v-l)a  -  1.5  a2; 

*5  “  ^(-2^  +  3a2); 

•Cg  ■  2(2v-1)k4  +  3a  -  12(2v-l)a2  -  15a3; 
where  a  ■  1  -  x(v)  has  the  pdf: 

f(x)  -  [2v/2“1r(v/2)]_1  x*-1  exp (-  |  x2)  0<x<», 

and  cumulative  probabilities  can  be  obtained  from  cumulative  chi-square 
probabilities  using  Pr(x(v)<x)  -  <  x)  -  Pr(Y  <  x2),  where  Y  Is 

chl-squared  with  v  degrees  of  freedom.  The  parameters  of  the  five 
x(v)  distributions  used  are  shown  in  Table  8. 

Table  8:  y(v)  distributions 


V 

'h 

S2 

1 

1.00 

3.87 

2 

0.63 

3.25 

3 

0.49 

3.11 

14 

0.20 

3.00 

30 

0.13 

3.00 

2.9  EPF  Statistics  for  Goodness-of-Flt  (denoted  by  *1*  on  Figure  1) 

We  here  consider  the  asymptotic  distributions  of  the 
Cramdr-von  Mlses  statistic  W2,  The  Watson  statistic  U2  and  the 
Anderson-Darling  statistic  A2.  These  statistics  measure  the  discrepancy 
between  the  empirical  distribution  function  Fq(x)  and  the  hypothesized 
cdf  F(x),  where  F(x)  may  contain  unspecified  parameters.  Four  cases 


are  considered  here: 


Case  1.  F(x)  Is  Che  normal  distribution  with  o2  known,  u  estimated 
by  the  sample  mean  5c. 

Case  2.  F(x)  Is  the  normal  distribution  with  u  known,  o2 

estimated  by  —  £(x,-y)2. 

n  x 

Case  3.  F(x)  Is  the  normal  distribution  with  y  and  o2  estimated 

by  x  and  s2  ■  — — r  Z(x.-x)2. 

n-x  l 

Case  4.  F(x)  ■  1  -  exp(-dx),  x  >  0  (the  exponential  distribution) 
with  0  estimated  by  1/S. 

For  all  these  cases ,  the  asymptotic  distributions  can  be  expressed 
as  an  infinite  sum  of  weighted  chi-squared  variables  each  with  one  degree 
of  freedom. 

Exact  means  for  all  four  cases  of  W2,  U2  and  A2  and  exact 
variances  for  W2  and  U2  are  given  by  Stephens  (1976).  Weights  were 
obtained  from  Stephens  (1976)  and  the  method  of  Imhof  (1961)  was  used 
for  computing  cumulative  probabilities.  The  percentage  points  obtained 
were  compared  with  those  given  by  Durbin,  Knott  and  Taylor  (1975) , 
who  independently  used  the  same  method.  Higher  cumulants  were  cal¬ 
culated  from  the  weights,  as  In  section  2.4.  Five  distributions  were 
used:  W2,  case  1;  U2,  cases  2  and  4;  A2,  cases  1  and  3.  The  parameters 
and  significance  points  for  these  distributions  are  given  In  Table  9. 

Table  9:  Goodness-of-Fit  Statistics 


Statistic 

W2, 

case 

1 

u2. 

case 

2 

u2. 

case 

4 

A2, 

case 

1 

A2, 

case 

3 

1.85  8.72 

1.64  7.52 


0.85 

0.90 

0.95 

0.975 

0.99 

0.995 

0.9975 

0.1165 

0.1344 

0.1653 

0.1965 

0.2381 

0.2698 

0.3017 

0.1052 

0.1218 

0.1507 

0.1804 

0.2208 

0.2519 

0.2836 

0.1116 

0.1289 

0.1588 

0.1892 

0.2300 

0.2613 

0.2930 

0.7819 

0.8937 

1.0874 

1.2847 

1.5510 

1.7561 

1.9640 

0.5610 

0.6318 

0.7528 

0.8742 

1.0359 

1.1592 

1.2833 

2.10  Thickened  Range  from  a  Uniform  Distribution  (denoted  by  '  J*  on  Figure  1 


Let  X(2)'  X(n)  be  the  ordered  observations  from  a  sample 

of  size  n  from  the  uniform  distribution  with  pdf  f(x)  -  1,  0<x<l,  and 
let  the  "thickened''  range  be  defined  by  (David  (1970,  p.  146) : 


Wn  "  (X(n)"X(l))  +  (X(n-l)“X(2))  +  ***  +  (X(n-p+l)“X(p)) ’ 

where  p  -  (^] ,  the  greatest  integer  <  tt.  The  cdf  of  W 
t  £  n 

is  given  by  Stephens  (1972)): 

F(w  )  -  1  -  Z  i”n[ (K.+L. )<i-w  >n+nL.w  <i-w  >n”1] ,  0<w  ip, 

n  i  i  n  x  a  n  n 

where  <x>  ■  x  for  x  >  0,  <x>  -  0  for  x  <  0.  If  n  *  2p+l, 


P 

-2Li  Z 


r-1 

r*i 


i-r 


l»2,...,p. 


while  if  n  *  2p,  we  have: 


L  -  l 

1  1  r-1 

n*i 


£)• 


i  -  1*2, . . . , (p-1) ; 


L  -  0; 


K  -  -  L, 


P 

Z 

r-1 

n*i 


(4- 


i-p 


i  *  1,2, ... , (p— 1) ; 


K 


P“1  f  _  )z 

n  rM  • 

r-1  ^ 


Formulae  for  the  first  four  central  moments  of  Wq  are  similarly  given 
in  Stephens  (1972).  Six  Wq  distributions  were  used  in  this  study, 
with  parameters  given  in  Table  10. 
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Table  10:  W  distributions 


.24 

2.79 

9 

0.00 

2.83 

o 

o 

• 

2.63 

10 

0.04 

2.90 

.12 

2.86 

19 

0.00 

2.93 

2.11  Resultants  of  Random  Unit  Vectors  in  Three  Dimensions  (denoted  b 


K*  on  Figure  1) 

Suppose  n  unit  vectors  are  uniformly  distributed  in  three  dimensions; 
that  is.  If  Op^  is  a  typical  vector,  the  origin  is  fixed  at  0  and  p^ 
moves  uniformly  on  the  surface  of  the  sphere  with  center  0  and  radius  1. 
Let  X  be  the  vector  sum  of  the  n  vectors.  The  cdf  of  X  is  given  in 
Stephens  (1964) .  Cumulants  of  Z  *  X2  are  given  by  Solomon  and 
Stephens  (1975),  as  follows: 

-  n; 

2 

<2  -  -j(n2-n) ; 

<2  *  f  (n-2); 

k4  -  ||(5n4-30n3+52nz-27n). 

The  random  variable  X  has  range  (0,n)  and  approximate  significance 

points  x  of  X  are  related  to  those  for  Z  by  x  ■  /z~  .  This  follows 
a  a  a 

from  the  relationship  a  -  Pr(X<xfl)  ■  Pr(/z<xa)  ■  Pr(Z<xQ2).  Distributions 
of  vector  resultants,  denoted  by  RUV(n),  are  considered  for  9  values 
of  n,  with  parameters  given  In  Table  11. 


Table  11;  RUV(n)  distributions 


2.12  Extreme  Value  Distribution  (denoted  b 


The  random  variable  X  has  the  Type  I  extreme  value  distribution 


if  X  has  pdf: 

f(x)  ■  exp (-x  -  exp(-x)),  —<x«».  (Johnson  and  Kotz  1970a,  p.272). 


The  cdf  is  then: 


F(x)  -  exp(-  exp(-x)),  — <x<«*, 

and  the  cumulants  are  given  by: 

-  -i|»(l) ; 

ic  -  (-l)r  ♦(r”1)(l),  r>2.  (Johnson  and  Kotz  (1970a,  p.278) 

Here  we  are  considering  only  one  distribution,  with  ■  1.14  and 
82  -  5.40. 

2.13  Compound  Laplace  Distributions  (denoted  by  'M'  on  Fi 


The  random  variable  X  with  pdf: 

f(x)  -  |  (1  +  |x|r(o+1)  — <x<«,  a>0, 

has  the  (standardized)  compound  Laplace  distribution  (Johnson  and  Kotz 
(1970a,  p.32)).  This  distribution  will  be  denoted  here  by  L(a).  The 
L(a)  distributions  are  symmetric  about  zero  and  moments  of  order  a  or 
greater  do  not  exist.  For  r  even  and  less  than  a. 


u  ■  a 


E  (-1)^  [f] (a+1-r)”^  (Johnson  and  Kotz  (1970b,  p.32)). 


The  cdf,  as  given  by  Johnson  and  Kotz,  was  found  Co  be  Incorrect. 
The  correct  cdf  is: 

for  x<0,  F(x)  ■  |  j(l-t)”^a+1^  dt. 


t)-(a+1>  dt. 


l/,  ,  — ct 

-  y(l-x)  ; 


for  x>0,  F(x) 


-Wf 


(l+t)“(a+1)  dt. 


-  1  -  y(l+x)“a. 

The  parameters  of  the  26  L(a)  distributions  used  are  displayed  In 
Table  12. 

Table  12:  L(a)  Distributions 


12.97 

12.60 

11.96 

11.68 

11.20 

10.79 

10.44 

10.21 

10.00 

9.81 

9.64 


a 

S2 

11.4 

0.00 

9.44 

12.4 

0.00 

9.01 

13.0 

0.00 

8.80 

13.6 

0.00 

8.62 

14.4 

8.41 

15.4 

0.00 

8.19 

17.8 

0.00 

7.80 

19.5 

0.00 

7.60 

21.6 

0.00 

7.40 

24.4 

0.00 

7.20 

39.0 

0.00 

6.70 

52.0 

0.00 

6.51 

100.0 

0.00 

6.25 

2.14  Subbotin  Distributions  (denoted  by  'N  on  Figure  1) 

The  random  variable  X  has  a  Subbotin  dlatrlbutlon  with 
parameter  6,  denoted  by  S(5),  If  X  has  pdf: 


f(x)  -  [25/2+1r(f»-l)]-1  exp(4|x|2/6).  — <x<-,  6>0. 


2‘*/j  2 1 

This  Is  the  standardized  form  of  the  pdf  given  by  Johnson  and  Rotz 
(1970b,  p.33).  This  distribution  is  symmetric  about  zero  and  has 
finite  moments  of  all  positive  orders,  with 


2*sr6r (fcffi-5-) / r 4)  (r  even) . 


^  —  -v  2  '  9  '2 

In  order  to  make  percentage  point  comparisons,  the  cdf  of  S(6)  was 


derived,  as  follows: 


U. 


for  x>0,  F(r)  -  k  I  exp(-|t2/S)dt,  k  -  [22  rcl+l)]'1, 

.  I  +  k  j*  erp(-i  t2/5)dt, 

1  2/J  6  . 


^1 

1  2  A 
j  +  2Z  «k 


e“y  dy. 


1  +  ip  A  12/6 

2  +  2  F'2'  2  X  '* 


where  P(a,x)  -  [r(a)]_1J  ta_1  e-t  dt  is  the  incomplete  gamma  function 

0 

ratio  (see  e.g.  Abramowitz  and  Stegun  (1965,  p.260)>. 

For  x<0,  F(x)  -  1  -  F(-x). 

Thirteen  S(6)  distributions  were  used,  with  parameters  as  shown  in 


Table  13. 


'O'  on  Figure  1 


The  random  variable  X  has  Che  hyperbolic  secant  distribution 
(Johnson  and  Kotz  (1970b,  pp. 15-16))  if  X  has  the  pdf: 

f(x)  -  ir  1  sech  (x),  -®<x<“. 

This  distribution  has  cdf: 

F(x)  ■  +  it  ^  tan  ^(sinh  (x))  -«xx<». 

(Note  that  the  formula  given  for  F(x)  in  Johnson  and  Kotz  (1970b, 
p.15)  is  in  error).  The  distribution  is  symmetric  about  zero  and 
central  moments  of  even  order  are  given  by: 


yr  "  7  F(r  +  X)  B(r  +  1}* 


where  the  function  6(n)  -  Z  (-l)k  (2k  +  l)_n  is  tabulated  in 

k-0 

Abramowitz  and  Stegun  (1965,  p.812)  to  18  decimal  places  for 
n  «  1(1)38.  For  this  distribution,  /(TJ-O,  82“5. 


2.16  Laplace  Distribution  (denoted  by  'P'  on  Figure  1 


The  random  variable  X  has  the  standard  form  of  the  Laplace 
distribution  (also  known  as  the  double  exponential  distribution)  if: 

f(x)  “  -r  exp  (— | x | ) ;  -«<x<». 


The  cdf  of  this  distribution  is: 


F(x)  -  -  exp(x)  x  <  0, 

1  -  y  exp(-x)  x  >  0. 

The  distribution  is  symmetric  about  zero  and  central  moments  of  even 
order  are  given  by: 

ur  ■  r!  (Johnson  and  Kotz  (1970b,  p.23)). 


Thus  »  0  and  “  *>• 


2.17  Cosine  Distribution  (denoted  b 


on  Flsmre  1 


The  random  variable  X  has  the  cosine  distribution  if  X  has  the 


f(x)  ”  (2ir)  (1  +  cos  x) ,  -ir  <  x  <  ir. 

The  distribution  was  studied  by  Raab  and  Green  (1961)  who  suggest 
it  as  a  possible  substitute  for  the  normal  distribution.  The  cdf  is 


given  by: 


F(x)  -  y  +  (2tr)  1  (x  +  sin  x) ,  -tt  <  x  <  it. 


This  distribution  is  symmetric  about  zero,  odd-order  moments  are  zero 


and,  for  r  even: 


y;  -  E(X  )  -  (2ir) 


-1  r  (»r  ♦  *r 

*  — ir 


cos  x)  dx. 


TTr  ,,  v-1  fw  r 

m  +  <2*>  * 

-IT 


cos  x  dx. 


Since  j  xn  cos  x  dx  ■  xn  sin  x  +  n  xn  ^  cos  x  *  n(n-l)  j  x11 

■it 

cos  x  dx,  and  using  x2  cos  x  dx  *  -4it,  moments  can  be  easily  computed 

-IT 

for  r  ■  2,  4,  6 .  successively.  For  this  distribution,  *  0  and 


2.18  Generalized  Gamma  Distributions  (denoted  by  ’R*  on  Figure  1 


The  random  variable  X  has  the  generalized  gamma  distribution 
(standard  form)  if  X  has  pdf: 

f(x)  -  |p|[T(v)]  1  xpv-1  exp(-xP),  0  <  x  <  “,  p  #  0,  v  >  0. 

This  distribution  was  studied  by  Stacy  and  Mihram  (1965)  and  will  be 
denoted  here  by  GG(p,v).  The  Weibull  and  chi  distributions  used  in 
this  study,  as  well  as  exponential,  gamma  and  chi-squared  distributions, 
are  special  cases  of  this  distribution.  Moments  are  obtainable  from: 

u;  -  E(Xr)  -  r[(pv  +  r)/p]/r(v)  £  >  -v. 

Cumulative  probabilities  can  be  computed  from  the  cdf: 


F(x)  -  PW.xP) 


p  >  0, 


1  -  P(v,xp)  p  <  0, 

where  P(a,x)  is  the  incomplete  gamma  function  ratio,  defined  in  section  2.14 
Thirty-six  GG(p,v)  distributions  were  used,  with  parameter  values  given 


in  Table  14. 


Table  14:  GG(p,v)  distributions 


I» 


[»7» 


As  an  alternative  to  fitting  a  theoretical  pdf  to  data  and 
Integrating  to  obtain  cumulative  probabilities.  Burr  (1942)  suggests 
fitting  a  theoretical  cdf.  Among  several  possibilities,  he  considers 
the  cdf: 

F(x)  ■  1  -  (1  +  xC)”k,  0  <  x  <  ®,  c,k  >  1. 

This  distribution,  denoted  here  by  Burr  (c,k),  has  pdf: 

f(x)  -  ck  xc"1(l  +  xC)"(k+1)  0  <  x  <  ». 

Burr  (1942)  considers  cumulative  moments,  ,  defined  by: 

M j  "  |  x^(l  -  F (x) ) dx  -  j  x^F(x)dx. 

For  the  Burr  (c,k)  distribution,  M.  ■  r(^%T(k  -  ^%/[cr(k)], 

j  c  c 

j  <  ck  -  1.  The  first  four  central  moments  can  then  be  obtained 
using  the  relations  (Burr  (1942,  p.224)): 

M  -  Mq; 

“2  "  “l  *  M0; 

u3  -  3M2  +  2M3; 

U4  -  4M3  -  12M2Mq  +  12M1M2  -  3Mg. 

Twenty  Burr  (c,k)  distributions  are  used  here,  with  parameters  displayed 
in  Table  15. 

Table  15:  Burr  (c.k)  distributions 


1.91 

12.46 

6 

4 

0.02 

3.17 

1.59 

10.81 

7 

1 

1.46 

10.36 

0.92 

5.13 

7 

8 

0.30 

3.14 

0.48 

3.38 

8 

1 

1.22 

8.34 

0.96 

5.94 

8 

2 

0.19 

3.74 

0.51 

3.87 

9 

1 

1.06 

7.22 

0.64 

4.63 

9 

2 

0.11 

3.67 

0.12 

3.19 

0.94 

6.51 

0.43 

4.11 

2 

0.04 

3.65 

0.12 

3.36 

3 

0.21 

3.42 

Let  Y  have  a  gamma  distribution  with  pdf: 

f(y)  -  rrccor1  y0"1  e‘y  0  <  y  <  a  >  0, 
and  let  X  ■  in  Y.  It  can  easily  be  shown  that  X  has  pdf: 

f(x)  -  [r(a)]”1  exp (ax)  exp (-exp (x)) ,  —•  <  x  <  •. 

This  distribution  was  considered  by  Olshen  (1938) .  To  evaluate  the 
moments  of  X,  with  distribution  denoted  by  LG(ct) ,  we  first  obtain  the 
moment-generating  function: 

M^t)  -  E[exp(tX)  ] , 

-  [r(a)]”1  |*  exp[x(a  +  t)]  exp(-exp(x)dx, 

■  tr(a)]'1  za+t-1  exp(-z)dz,  where  z  «_exp(x), 

J  n 


-  r(a  +  t)/r(o). 

The  cumulant-generating  function  is  then: 

K  (t)  -  £n[M (t) 3 , 

X  X 

*  ini’ (a  +  t)  -  ini’ (a). 

Thus,  the  LG (a)  distribution  has  cumulants  given  by: 

-  ^(a); 

ic  -  (o);  r  >  2. 

Cumulative  probabilities  can  be  evaluated  using  the  relationship 
Pr(X  <  x)  *  Pr(£n(Y)  <  x)  -  Pr(Y  <  exp(x)),  where  Y  has  the  above 
gamma  distribution.  Five  LG (a)  distributions  were  used  in  this  study 
with  parameters  as  displayed  in  Table  16. 

Table  16:  LG (a)  Distributions 


a 

’'h 

S2 

2.5 

0.69 

3.93 

3.0 

0.62 

3.76 

4.5 

0.50 

3.49 

7.0 

0.39 

3.31 

10.5 

0.32 

3.20 

3.  COMPUTATIONAL  METHODS 


3.1  Computation  of  Moments  and  Cumulative  Distribution  Functions 

In  addition  to  the  computational  algorithms  described  in  Section  2 
evaluation  of  several  auxiliary  mathematical  functions  was  required  for 
the  accurate  computation  of  moments  and  cumulative  distribution 

r  x-i  -t 

functions.  The  function  r(x)  ■  t  e  dt  was  evaluated  using 

Jo 

the  algorithm  of  Pike  and  Hill  (1966) .  This  function  is  necessary 
in  computing  the  moments  of  the  noncentral  t,  Weibull,  chi,  Subbotin, 
hyperbolic  secant,  generalized  gamma  and  Burr  distributions. 

For  computing  the  moments  of  the  z,  generalized  logistic,  extreme 
value  and  log-gamma  distributions,  evaluation  of  the  derivatives  of 
£n[r(x) ]  was  required.  The  digamma  (psi)  function  is  defined  by 

3 

iKx)  -  ~  Un(r(x)]>  -  r*  (x)/r(x).  Similarly,  *(s)(x)  -  ^  {^(x)} 
is  called  the  trigamma,  tetragamma,  pentagamma,  hexagamma ,  ... 
function  for  s  ■  1,2, 3, 4,...  For  the  distributions  described  here, 

<Kx)  Is  required  for  integer  and  half-integer  values  only;  thus  the 
following  formulae  (Abramowltz  and  Stegun  (1965,  p.258))  are 
sufficient: 

il>(l)  -  -y; 

Q_1  -1 

<l»(n)  -  -y+  1  k  ,  n-2,3,4,...  ; 

k-1 

<Ky)  -  -y  -  2  in  2  ; 

(n+|-)  «  -Y  -  2  In  2  +  2  (1  +  +  ...  +  .  n*l  ,2,3,... 

Here  y  ■  0.5772156649...  is  Euler's  constant,  given  to  25  decimal 
places  in  Abramowltz  and  Stegun  (1965,  p.3). 


Arbitrary  derivatives  of  t|i(x)  required  here  for  integral  values  of 


x  only,  were  computed  using  the  following  formula  from  Abramowitz 
and  Stegun  (1965,  p.260): 

^(m)  (i)  m  ?  (mfl),  m-1,2,3,...  ; 

<|/(n)(n+l)  -  (-l)mml  [-?(nH-l)  +  1  +  ...  +  ^-],  n-1,2,3 

m-1,2,3 

“  -n 

The  Riemann  zeta  function  g(n)  *  E  k  is  tabulated  for  n**2(l)42  in 

k«l 

Abramowitz  and  Stegun  (1965,  p.811). 

-I  fX  a-1  -t 

The  incomplete  gamma  function  ratio,  P(a,x)  -  [T(a>]  t  "e  dt, 

J0 

required  for  the  computation  of  the  Subbotin  and  generalized  gamma 
cumulative  distribution  functions,  was  evaluated  using  the  algorithm 
of  Bhattacharjee  (1970).  Chi-squared  and  gamma  probabilities,  required 
for  computation  of  chi  and  log-gamma  cumulative  probabilities,  were 
also  computed  using  this  algorithm.  The  algorithm  of  Kajumder  and 
Bhattacharjee  (1973)  for  computing  Ix(a,b),  the  incomplete  beta 
function  ratio,  was  used  for  computing  cumulative  probabilities  of  the 
z  distribution. 

3.2  Interpolation  for  Cumulative  Probabilities 

Although  tabulated  significance  points  are  available  for  the 
noncentral  F  distributions  and  the  goodness-of-flt  statistics,  direct 
computation  of  cumulative  probabilities  is  difficult.  Thus  inverse 
Interpolation  for  cumulative  probabilities  is  required.  Given  m  values 
ct^,  in  ascending  order,  and  the  corresponding  significance  points  x^, 
a,  the  approximate  value  of  a  corresponding  to  an  intermediate  value  x, 
can  be  computed  using  the  n-point  Lagrangian  interpolation  formula 
for  unequally  spaced  abscissa  values: 


-29- 


k+n-1 

a  -  Z  lt  (x)*a 
i-k 


(x-Xj^)  (x-x2) . . .  (x-xi_1>  (x-*i+1>  •  •  •  (X”*^)  (x“xn) 


where  l±  (x)  -  (,x±-t^)  . . .  (Xj-x^)  (Xj-x^j) . . .  (Xj-x^j) 

Here  k  is  chosen  to  determine  which  known  values  will  be  used  in  the 
Interpolation;  e.g. ,  if  n-4,  k  might  be  chosen  so  that  two  values  of 
x^  lie  on  either  side  of  x. 

Since  Interpolation  using  the  above  formula  was  found  to  be 
occasionally  inaccurate,  regardless  of  the  values  chosen  for  n  and  k, 
the  work  of  Pearson  (1968)  (see  also  Pearson  and  Hartley  (1972,  pp. 139-141) 
was  consulted.  For  the  Inverse  problem  of  finding  x  for  a  given  a, 
where  the  values  are  known  at  standard  significance  levels  x^, 

Pearson  suggested  use  of  the  logit  transformation  y^  ■  in(a^/(l-a^)) . 

He  concluded  that  the  logit  transformation,  in  conjunction  with  the 
Lagrangian  interpolation  formula,  led  to  quite  accurate  results. 

This  method  was  applied  here  to  the  problem  of  determining  a  for 


k+n-1 

a  given  x.  The  formula  y  -  Z  l. (x)  y.  was  used;  then  a  was  determined 

i-k  1 

by  a  -  eY/(l+eY).  The  case  n-4  was  found  to  be  sufficiently  accurate; 
where  possible,  k  was  chosen  so  that  two  of  the  x^'  s  were  less  than 
x  and  two  greater  than  x. 
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